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(57) Abstract 

A method for monitoring and control of an industrial or other technical process, in which the process is dependent on a multiple of 
variables (X) relevant to the process. The process comprises a description of the relevant variables as a multidimensional room, where each 
variable represents an independent component in the room, that the observations of the variable values at a certain time point represent 
a point in the multidimensional room, that observations from a number of time points form a point swarm in the room, that at least one 
first (p a ) and one second (pt>) principal direction of the point swarm are calculated, that the projections of the observations onto these first 
and second principal directions are determined, that the observations are illustrated graphically in the plane which is formed by the first 
and second principal directions in the point swarm, further that the principal directions (p a , pb) are continuously updated during the course 
of the process, the graphic information thus becoming dynamic, and, finally, that the process operator or the monitoring member, on the 
basis of the presented information, adapts members which influence variable quantities in the process such that the process is controlled to 
optimum operating conditions. 
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Method for monitoring multivariate processes 
TECHNICAL FIELD 

5 The present invention relates to method of monitoring an 
industrial process which is dependent on a large number of 

parameters, available through measured data, in a way which 
makes it possible to control the process to the desired 
conditions by allowing the relevant variables of the process 

10 to be represented by the axes in a linear space with as many 
dimensions as the number of variables, whereupon the process 
is projected onto a plane or a three-dimensional room, such 
that a calculated model of the process is obtained on-line and 
by comparing the model of the process with a reference model 

15 of the process such that a distance to the reference model is 
obtained, whereupon, when observing a drift of some parameter, 
the process can be restored to at least one norm range for* the 
process by acting upon a deviating variable . 

20 BACKGROUND ART 

For obtaining, for instance , the desired quality of a manufac- 
tured produce in a manufacturing process with the best economy 
or otherwise monitoring an industrial process or industrial 

25 application, it is necessary to control the processes as 

efficiently and optimally as possible . A manufacturing process 
includes many important variable quantities (here only 
referred to as variables ) , the values of which are affected by 
the variations of the variables during the course of the 

30 process . The optimum result is achieved if the process- 
monitoring operator or the process -monitoring member is able 
to handle and control all the process-influencing variables in 
one and the same operation. 

3 5 A conventional method of optimizing a process is to consider 

one variable at a time only, one-dimensional optimization. All 
the variables are fixed except one, whereupon the non- fixed 
variable is adjusted to an optimum result . Thereafter , the 
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free variable is fixed and one of the other variables adjus- 
ted, and so on . 

When the process variables have been set in this way one by 
5 one , it is supposed that the best working point of the process 
has been obtained. However , the fact is that this is not the 
whole truth. The process may still be far from its optimum 
working point, since the method does not take the mutual 
influence of the process variables into account * The diffi- 
0 culty of this method is to obtain a total overview of the 
process based on a number of mutually independent process 
variables as necessitated by such a view. It is only when the 
relationship between these variables can be interpreted 
correctly that the process operator gets a real overview and 
5 understanding of the process . 

An operator is limited by his or her human ability to under- 
stand and control only a limited number of variables per unit 
of time . A process monitoring system measures up to hundreds 
0 of variables , of which perhaps some 20 more or less directly 
control the process . Such a monitoring system requires a 
computer which can cont inuous ly register if and when slight 
variations occur in any of the variables. 

5 A model of a process is realized substantially by two dif f e- . 
rent types of modelling techniques , mechanistic and empirical 
modelling . Mechanistic models are used, for example , in 
physics . Data are used to discard or verify the mechanistic 
model . A good mechanistic model has the advantage of being 

0 based on established theories and is usually very reliable 
over a wide range . However , the mechanistic model has its 
limitations and is only applicable for relatively small , 
simple systems , whereas it is insufficient, if even possible 
to use , for building an axiom around a complex industrial 

5 process . Many attempts have been made to model processes with 
the aid of mechanistic models based on differential equations. 
An important disadvantage of these models , however , is that 
they are greatly dependent on the dependence of certain 



95 1 8420A2 



> 



WO 95/18420 



PCT/SE94/01208 



3 

parameters on each other. Such parameters with great depen- 
dence on each other must be determined for the model to 
function. In the majority of cases it is very difficult to 
quantify them in a reliable manner . A consequence of this is 
that it is very difficult to obtain mechanistic models that 
work in practice . 

In empirical modelling the model is based on real data, which, 
of course, requires good-quality data . Process data consist of 
many different measured values . In other words , process data 
are multivariate , which presupposes multivariate techniques 
for process data to be modelled and illustrated. Different 
statistical methods exist for multivariate modelling . 
Traditional multivariate modelling technique , as for example 
linear regression (MLR) , assume independent and error-free 
data. For that reason, such technique cannot handle process 
data , since they are highly interdependent and , in addition, 
influenced by noise . 

A solution to the above problems is to use projection 
technique . This technique is capable of selecting the actual 
variation in data and expressing this information in so-called 
latent (underlying) variables . The technique is described in 
ABE Review 4/93, Bert Skagerberg , Lasse Sundin . The projection 
technique is most advantageous for obtaining a fast overview 
of a complex process . The two projection techniques , PCA and 
PLS , that is , Principal Component Analysis and Projection to 
Latent Structures , are tailor-made for solving problems such 
as process overview and identification of relationships 
between different process variables . 

Models created with these two methods can be executed directly 
(on-line) in the process information system and can be used 
for process monitoring. PLS is highly suitable for predicting 
various quality-related variables , which are normally diffi- 
cult to measure or sometimes even impossible to measure 
routinely since they occur late in the process . 
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Modelling by means of projection techniques (PCA, PLS ) is best 
explained by the use of simple geometry in the form of points, 
lines and planes. Process data are usually listed in the form 
of tables, wherein a row represents a set of observations , 
5 that is , registration of variable values , in the process at a 
certain time . For practical reasons , and for the sake of 
clarity, the description will be restricted in the following 
to a data table with three variables , that is, three columns , 
which can be illustrated geometrically with the aid of a 

10 three -dimensional coordinate system (Fig . 1} , where the 

variables in the process are represented by the axes in the 
coordinate system. However , the method functions for an 
arbitrary number of variables, K, where K > 3 , e.g. K = 50 or 
K = 497 . An observation of the relevant variables in the 

15 process at a certain time may here be represented by a point 
in the coordinate system which is common to all variables , 
which means that the measured value of each variable corre- 
sponds to a coordinate for the respective axis . Mathemati- 
cally , independently of the number of coordinates , a row in 

20 the table still corresponds to a point . All n rows in the 
table then correspond to a swarm of points (Fig . 2 ) . The 
mathematical procedure for describing a process with K 
relevant variables is handled in the same way by the observa- 
tions at each time being represented by a point in a multi- 

25 dimensional room with K coordinates . 

The pro j ection method works on the assumption that two points 
that lie close together are also closely related in the 
process , 

30 

The data set may now be projected to latent variables in a 
series of simple geometrical operations as follows : 

- The midpoint in the data set is calculated. This calculated 
3 5 point is called x. The midpoint coordinates correspond to the 

mean value of all the variables in the system (Fig. 3 ) , 

- Starting from the midpoint x, a first straight line, pi , is 
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drawn, which is adapted to the data set such that the distance 
to the line for the individual points is as small as possible . 
This line corresponds to the direction in the data set which 
explains the greatest variation in the process , that is , the 
5 dominating direction in the data set and is referred to as the 
first principal direction. The direction coefficient of this 
line is combined in the loading vector pi . Each point in the 

data set is then projected orthogonally to this line . The 
coordinates from the projection of all the points to the line 
10 form a new vector ti . {Each point gives a value , here called 
"score M , as a component in the vector ti J 

- The new vector (t i } is usually called score vector and 
describes the first latent variable . This latent variable 

15 expresses the most important direction in the data set and is 
a linear combination of all three variables (or in a multi- 
dimensional system all K variables involved) . Each variable 
has an influence on the latent variable which is proportional 
to the size of the direction coefficient in the loading vector 

20 pi - 

- Even if the line, the first principal direction, Pi, given 
by the loading vector , pi , according to the above is one that 

most closely agrees to the data set , it can still be seen from 
25 Figure 4 that the deviations from the line are relatively 

large . A second line, p2 , may be adapted to the point swarm 
which represents data in the process . This second line , p2 , is 
orthogonal to the first line , pi , and describes the next most 
important direction in the point swarm (Figure 4} and is 
30 referred to as the second principal direction. The score 

vector t2 and the direction coefficient P2 are interpreted 
analogously to ti and pi . 

Analogously, a third projection line can be constructed with 
35 the direction P3 and the score vector t3 . However , the value 
of computing a third principal component in this three- 
dimensional example is limited, since the resulting three 
latent variables ti , t2 and t3 only represent a rotated 
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version of the three-dimensional coordinate system. 

If, instead, a look is taken at the projection plane which is 
defined by the first two principal directions, pi and P2 , it 
5 can be determined that this plane describes the point swarm 
well in two dimensions only . The advantage of this is that 
points projected onto a plane reproduce information which 
emanates from variables in three dimensions. This is one of 
the reasons for using PC A to analyze a complex data structure . 

10 From a number of variables a small number of underlying latent 
variables may be obtained, these latent variables describing 
the main part of the systematic information about current 
process data . From experience, it has proved that more than 2- 
6 latent variables are not required. This can also be shown 

15 theoretically . The latent variables provide an overview of the 
data set and can be presented in the form of different types 
of diagrams or gr aphi c images . Part of the variation of the 
data set will remain after the latent variables have been 
extracted and are called residuals (deviations ) . These contain 

2 0 no systematic information and may therefore be regarded as 

superfluous and are often referred to as noise . 

According to one approach, the proj ection plane , defined by 
the lines pi and p2 , may be seen as a two-dimensional window 

25 into the mul t idimens i onal { in the example the three- 
dimensional) world. The basic idea behind PCA is to construct 
such a projection window , providing the viewer with a picture 
of the multidimensional data set. Consequently, PCA ensures 
the best possible window, that which contains the optimum 

30 picture of the data set . Further, the window can be saved and 
displayed graphically . The projection window visualized on a 
computer screen provides an operator, for example , with an 
overview of a complex process . 

3 5 The projections described above are essentially a geometrical 

interpretation of the principal component analyses which have 
proved to be very suitable for obtaining an overview of 
process data. Normally, it is sufficient also among hundreds 
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of variables to calculate about three principal components to 
describe the principal information in the data set . Typical of 
the PCA method , when applied to process data , is that the 
system easily selects a strong first component , a less impor- 
5 tant second component , and a third component describing little 
but systematic information. 

The PCA method is suitable to use for analyzing blocks of 
process data. Questions which may be answered in an industrial 
10 process by means of PCA are : 

- Overview of a quantity of data 

- Classification {e.g. if the process continues normally or if 
it deviates) 

15 - Real-time monitoring (e.g. to track the conditions 
and discover an incipient deviation as early as possible ) . 

Another important problem is to identify relationships between 
process data, X , and more quality-related data, Y. This type 
20 of relationships are difficult to analyze, if even possible 
using traditional modelling techniques , since the relation- 
ships are often hidden in complex interactions and correlation 
patterns involving different process variables . 

25 Projection to latent structures , PLS , is a projection 

technique which offers a method of modelling complex relation- 
ships in a process . PLS decomposes two blocks of data, X and 
Y, into principal components as projections (Fig. 5) . The two 
modelled blocks are similar to the solution according to the 

30 PCA method, but differ in that in PLS the projection is made 

to explain X and Y s imul t aneous ly for the purpose of obtaining 
the best possible correlation between X and Y . Thus , the PLS 
method serves to model the X block in such a way that a model 
is obtained which in the best way predicts the Y block. A PLS 

3 5 model can thus be very useful for predicting quality -related 
parameters , which are otherwise both expensive and difficult 
to measure . instead of having to wait perhaps a week before a 
critical value from the quality control laboratory becomes 
available , this value can be immediately predicted in a model . 
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Figure 6 illustrates an example of how the study and monito- 
ring of an industrial process can be visualized by means of a 
computer screen on-line . The left half of the figure shows a 
score plot , that is, a representation of the observations of 
5 the measured data of the process from two latent variables t a 
and tb reproduced with two principal directions p a and pb as 
axes in the coordinate system of the graph . The left half of 
the picture shows both a static and a movable picture . The 
static picture consists of points which describe the variation 

10 in the reference data which are used for building the model . 
If these reference data are chosen in the best way, the 
picture consists of good working ranges for the process as 
well as ranges which should be avoided in the process . The 
picture may be compared to a map containing information as to 

15 whi ch conditions the operator should strive to direct the 
process to , and which conditions should be avoided. 

On-line execution of measured process data results in calcula- 
ted markings , that is , that observations made at a new time 

20 are reproduced as a new point in the plane which is represen- 
ted by that plane which, in the form of the two selected 
principal directions p a and pb/ constitute the coordinates of 
the graph on the screen. This means that each new point 
contains information about ail the relevant measured data 

25 because of the projection to the latent variables according to 
the PLS method. Changes in the process may then be reproduced 
on-line on the screen in the form of a line in the left half 
of the VDU. The changes are reproduced with the aid of a 
movable figure in the form of a curve which connects the 

30 observations at different times . The curve will thus move in 
time over the screen like a crawling " snake " . To make the 
operator better understand the significance of the information 
provided by the crawling snake, the snake may be divided into 
a head and a tail , which are also illustrated in different 

35 colours and symbols . The head consists of present observa- 
tions , whereas the tail is built up of "historical " observa- 
tions . If an alarm is raised, that is, when the curve (the 
snake) detects "prohibited" areas , the snake may change 
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colour, for example to red . 

The movable curve is an aid to the operator to continuously 
monitor the status of the process by viewing the process 
5 through a "window u on the screen into the multivariate rooms 
of the process . The location of the snake's head is compared 
with the area where reference data of high quality have been 
attained. The ambition of the operator or the monitoring 
member of the process should be to control the process to this 
10 area . 

To the right in Figure 6 there is shown an example of other 
information which may be imparted to the operator via the 
screen with the aid of the PLS method. The right picture is a 

15 reproduction of loading vectors , a loading plot . This is a map 
of how the score plot , that is , the curve in the left picture, 
is influenced by the individual variables in the process . The 
left and right picture halves also contain associated informa- 
tion. This means that the direction in the left picture has a 

20 direct correspondence in the right picture . The operator may 
receive guidance from the right picture if he/she is to con- 
trol individual process variables for the purpose of moving 
•the process (the "snake" ) to achieve better operating 
conditions for the process . 

25 

The use of the method described above means a powerful instru- 
ment in monitoring processes which are dependent on a large 
quantity of process variables in a simple and clear way . As 
examples of technical fields , within which process monitoring 

3 0 of industrial processes according to the described methods may 
advantageous ly be utilized, may be mentioned the pulp, paper, 
chemical , food, pharmaceutical , cement and petrochemical 
industries as well as power generation, power and heat distri- 
bution, and a wide range of other applications . However , the 

3 5 PCA and PLS methods , respectively , used according to the prior 
art suffer from a weakness in that the projection plane which 
is built up of two principal directions, and to which plane 
the observations are projected, are fixed and do not change 
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during the course of the process . This means that changes in 
the swarm of points in the multidimensional space, which has 
constituted the base of the calculation of the principal 
directions p a and pfc>, are not taken into account. At the same 
5 time new observation series are constantly added during the 
process, in which variable values may be changed, which means 
that the geometry of the point swarm in space may be changed 
and that the calculated principal directions which are inten- 
ded to reflect the shape of the point swarm are no longer of 
10 interest . This is not reflected by the graphically reproduced 
information about the course of the process according to the 
above . 

SUMMARY OF THE INVENTION 

15 

The present invention relates to a method for monitoring and 
control of an industrial or other technical process , in which 
the course of the process is dependent on a multiple of varia- 
bles relevant to the process. The method involves a descrip- 

20 tion of the relevant variables as a multidimensional room, 

wherein each variable represents an independent component in 
the room, that the observations of the variable values at a 
certain time represent a point in the multidimensional room, 
that the observations from a number of times form a point 

25 swarm in the room, that at least one first and one second 

principal direction of the point swarm are calculated, that 
the projections of the observations on these first and second 
principal directions are determined, that the observations are 
illustrated graphically in the plane which is formed by the 

30 first and second principal directions in the point swarm , 

further that the principal directions are continuous ly updated 
during the course of the process , whereby the graphic informa- 
tion becomes dynamic , and, finally, that the operator or the 
monitoring member of the process , based on the presented 

3 5 information, adapts members which influence variable quanti- 
ties in the process such that the process is controlled to 
optimum operating conditions . 
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According to the prior art, information about the course of 
the process is obtained by projecting measured data onto a 
plane which is comprised in the variable space which describes 
the process . The novel feature according to the invention is 
5 that the plane to which measured data are projected, according 
to the PCA and PLS methods , dynamically follows the flow of 
new series of measured process data, the proj ection plane 
being able to rotate in the multivariate room which describes 
the process . This provides a constant monitoring of the 
10 process in relation to the present stage and not , as 

previously, in relation to a "process historical " stage , 

The illustration of the course of the process may take place 
according to previously known technique, on-line or off-line 
15 in the form of a snake which crawls on a screen according to 

the above , or in the form of ordinary historical trend curves* 

When the process is visualized with the aid of a snake 
crawling over a plane f this means according to the invention 

20 that the direction of crawling of the snake illust rates a 
direction of the process taking into consideration how 
variable quantities temporarily influence the mode jl ox the 
process , in that variable values which slide away in different 
directions in the process influence the above-mentioned point 

2 5 swarm to assume new geometries . 

When showing the course of the process as a graph on a screen, 
reference data for the process are also plotted on the screen 
in the form of regions to which the process should be control- 
30 led . Also proc ess - inf luencing parameters are plotted on the 
screen to indicate which variables in the process have a 
strong influence on the process when the process slides in a 
certain direction indicated by the direction of movement of 
the graph on the screen. 

35 

If the process is on its way into prohibited or non- optimum 
regions, which is indicated on the screen by the graph moving 
into regions which are marked on the screen as forbidden, the 
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operator controls the process towards allowed regions by acti- 
vely influencing at least one member in the process which 
influences the variable or variables which is or are denoted 
by the graph as being capable of being influenced by the 
5 member or members which restore the process to the norm or 
reference region plotted on the screen. 

It is also possible, if desired, to automate the monitoring, 
by using known technique , by sensing which variable or 
10 variables can restore the process to the above-mentioned 
desired regions with known electronic devices , which then 
control the variable- influencing members in the process such 
that the process is maintained within given frames . 

15 According to the invention, new current models of the process 
are calculated dynamically . By continuously comparing the last 
calculated model with a reference model determined for the 
process , a real-time calculated value of the distance of the 
process from the reference model is obtained . When this dis- 

20 tance exceeds a value fixed for the process , it is practical 

to initiate an alarm. A variant of this alarm is arranged such 
that the most interesting part of the process , indicated as a 
graph in the form of a crawling snake on a screen, when this 
graph enters forbidden regions for the process , is coloured, 

25 for example , red. Other devices for calling attention 

requiring action are also to raise an alarm, for example, by 
means of a signal , a light , a lamp, etc . 

In another variant of the invention, a third principal direc- 
3 0 tion for the point swarm in the variable space is calculated, 
whereupon the observations are projected to the three-dimen- 
sional room which is defined and spanned by the three princi- 
pal directions , and that the three principal directions accor- 
ding to the invention are continuously updated during the 
3 5 course of the process , and that the observations are illustra- 
ted graphically on-line as projections in the room spanned by 
the three principal directions , which also in this case may 
take place by a graph in the form of a snake crawling between 
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the coordinates in the room to which the current observations 
of the process are projected. 

Applying the described method, process automation is given a 
5 very powerful instrument for monitoring and controlling, in a 
well-arranged manner, also very complex processes . 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 Figure 1 shows how collected data can be represented in a 
coordinate system with as many dimensions as the number of 
variables. An observation of the process represented as a row 
in a matrix. with variable values gives rise to a point in the 
coordinate system. 

Figure 2 shows a swarm of points, each one representing an 
observation of the process , in the coordinate system . 

Figure 3 shows how a first principal direction of the point 
20 swarm is formed. 

Figure 4 shows how a second principal direction of the point 
swarm is formed . 

25 Figure 5 illustrates how the PLS method models and identifies 
dependencies between two data sets , for example measured 
process data and quality-related data, which makes possible an 
immediate prediction of the occurrence of the process . 

3 0 Figure 6 illustrates in the left picture a score plot, which 
shows the state of the current process with the aid of a so- 
called "snake" which follows the course of the process, 
whereas the right picture shows a loading plot which, in turn, 
indicates how the process is influenced by process variables 

35 (TOT, FAR , PKR, etc . ) introduced into the coordinate system. 

Figure 7 illustrates the weighting of the observations in the 
monitored process in a long-term memory and a short-term 
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memory , respectively, according to the method. 

Figure 8 shows the utilization of control limits in the form 
of limits to the standard deviation from the mean value of the 
5 process , which limits may be used in a monitoring system to 
justify intervention into the process . 

Figure 9 explains how the projection plane, onto which all the 
observations are projected according to the invention, under 
0 certain circumstances may be subjected to an unintentional 
rotation of the model . 

Figure 10 denotes the exponential weights v in the data block 
and the loading block in the EWM-PCA algorithm. 

Figure 11 shows a so-called "distance-to-model " curve or DCL 
(Distance -To-Class) curve, which has been obtained explicitly 
for presentation of multivariate processes . The DCL curve 
describes the distances to the limits of the reference model . 
0 A multivariate alarm is defined depending on the level of the 
DCL (Dmod) . 

Figure 12 shows a schematic flow chart of the calculation 
steps in the calculating units which carry out the calcula- 
5 tions to obtain the model of the process as well as the dis- 
tance to the reference model . 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

0 According to the invention, modelling by means of PC A and PLS 
is used by dynamic updating of the process model by means of 
exponentially weighted observations and is described as multi- 
variate generalizations of the exponentially weighted moving 
average , abbreviated EWMA (Exponentially Weighted Moving 

5 Average) . 

Principles and determination algorithms for utilizing EWMA and 
realizing dynamic models, which according to the invention 
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make it possible to obtain optimum monitoring of a process, 
are presented in the following . Further , predicted control 
charts based on these models are shown . 

5 Standard PCA and PLS models assume an independence of process 
times , that is, that no process memory is utilized. Since 
projected observations (scores ) by means of PCA and PLS entail 
good " cross sections " of process data, a natural way to model 
" memory effects " would be to develop simple time-series models 

10 in these scores . One of the simplest models is available via 

EWMA, which provides both a good picture of the current status 
in a process and a "one- step-ahead" forecast about the 
process . In this way, an EMMA model based on multivariate 
pro j ected observation (scores) from PCA and PLS constitutes a 

15 natural extension of multivariate model standards for process 
applications . 

A generalization of EWMA into EWM-PCA and EWM-PLS consists of 
two parts . The first part is related to the use of scores 
20 instead of individual variables in control charts and predic- 
tions . The second part is the dynamic updating of the PCA and 
PLS models to allow the model to take into account the drift 
in the process . 

25 The obvious field of application of EWMA -PCA /PLS is process 

monitoring and control . Multiple responses are common in all 
types of automatic process control today , both because it is 
simple and inexpensive to measure many process -influencing 
quantities and because complicated products /controls impose 

30 many demands on criteria which must be monitored and controll- 
ed to ensure high quality of the product /control . As an 
example to illustrate EWMA- PCA in this disclosure , we use a 
(49x17 ) matrix with collected measured data of 17 variables 
from a paper machine over a period of time of 49 equal time 

35 intervals . The 17 variables comprise values from measured 

quantities such as the weight of the paper pulp, the moisture 
content ; the breaking stress of the paper , the velocity of the 
machine, etc . The method of utilizing EWMA offers , per se, 
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also entirely different possibilities within fields which 
attract increasing interest, such as pollution of rivers, 
lakes, oceans, etc. When monitoring such pollution, a plura- 
lity of variables are measured where the method according to 
5 the invention would offer a clear and well-arranged way of 
presenting data. In the case of, for example, emission of 
substances /particles from an industry into a reception area, 
such monitoring would permit feedback of presented measured 
values and permit control of a change of components in the 
10 emissions . 

In, for example, chemical/ technical contexts , other sequences 
than changes are often studied over time . Natural polymers, 
such as cellulose, DNA and proteins are built up of sequences 
15 of a set of monomers , wherein local monomer ic EWMA- PCA 

properties can be used to obtain information about such things 
as binding sites , etc . In such applications , it may be a 
natural thing to extend the exponentially decreasing weighs in 
both directions from the centre of the model . 

in the following description of the model, to achieve the 
method according to the invention, designations according to 
the following table are used . 

25 X a matrix with process variables (entries to predict Y) 
Y a matrix with "result " variables in PLS (responses , 

output values , product properties ) 
i , j index of observations , rows in X and Y; ( i , j = 1 , 2 , . 

N) 

30 N number of elements , observations , samplings , or process 

times ; (rows in X and Y) 
k variable index in X and Y; ( k = 1 , 2 , . . . , K) 
K number of variables in X or Y; (columns in X or Y) 
* used for designating a memory matrix for old values 

3 5 m index of response variables ; (m - 1, 2 , . . . , M) 

M number of PLS Y variables; (columns in Y in PLS model) 

vi weight of the observation i 

a component index; (a = 1, 2, . . . , A) 
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A number of components in the model 

W matrix of PLS weights (dimension K x A) 

w a columns in W, X weights of component a 

P loading matrix, dimension (K x A) 

5 Q memory matrix of loadings (or PLS weights) 

C matrix of PLS Y weights , dimension (M x A) 

c a columns in C, Y weights of component a 

T score matrix of X, dimension (N x A) 

t a columns in matrix T , scores of component a 

10 U matrix of u-scores , dimension (N x A) 

u a columns in matrix U, second scores of component a 

E a X or Y residuals after component a, dimension (N x K) 

F a PLS Y residuals after component a, dimension (N x M) 

15 EWMA may be regarded as a model with two components . The first 
component concerns the creation of a modelling variable y and 
predicting this variable y at a subsequent point in time . The 
second component concerns the arrangement of a control chart 
based on the model . 

20 

The basic idea behind EWMA is to model y as a weighted moving 
average, with the latest observations weighted heavier than 
earlier observations. Exponential weights 

25 vi = Xd-X)^ 1 (1) 

are used for the i'th observation which precedes the current 
one (i=t) , see Figure 7 . This gives the predicted value at the 
time t+1 according to equations (2 5 and (3) . These equations 
30 may at the same time be utilized to recursively update the 
EWMA model from time t to time t+1 according to : 

yt+i = *Yt + d->Uyt (2) 

35 = y t + My t ~ Yt) = Yt + ^ e t (3) 

Assuming that the residuals , e± , have a constant variance a 2 , 
the variance of EWMA will be : 
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Var(EWMA) = or 2 X/ (2-\) (4) 

A corresponding standard deviation (SD) may be used for 
creating control limits as, for example, three-sigma limits. 
5 Thus, the EWMA diagram can be used as a monitoring instrument 
for indicating if the process is significant at the side of 
the desired region to thereby justify an intervention. See 
Figure 8. Since, on the other hand, the model provides us with 
a prediction of y at the next observation time, EWMA may also 
10 be used as a base for modifying the difference between the 

prediction and the score value, that is , an achieved dynamic 
process control . For this purpose , a modified EWMA is 
recommended as follows : 

15 EWMA = y t + l = Yt + ^l e t + ^2 X e t + X 3 (e t - e t _i) (5) 

The values of the parameters X± to X3 are estimated from the 
process history . 

20 The principal component analysis, PC A, is usually based on an 
analysis of an (N x K) data matrix, Y, which starts with a 
matrix, centred and scaled into uniform column variance. PCA 
models this normalized matrix as a product of an (N x A) score 
matrix T, and an (A x K) loading matrix, P ' , as well as an (N 

2 5 x K) residual matrix, E . The number of product terms, the 

components A, define the dimensionality of the PC model . If 
the number of product terms, A, is equal to, or greater than, 
the dimension of X, N or K, the residuals E are identically 
equal to zero . The number of significant components , A, may be 

3 0 estimated in a plurality of ways ; here we advocate cross- 

validation. 

Y = X a t a *Pa' *E = TP ? -i-E * 6) 

3 5 The scores (the columns in T) are orthogonal and in many ways 
provide the best summary of data. This provides a good picture 
of the process if these scores are plotted into a diagram as 
dependent on time. 
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For an unweighted calculation of the principal components, t a 
and p a . division into singularity values (SVD) is a method to 
prefer if all the components are desired (a = 1,2, . . . , 
min (N,K) ) . If only a small number of first principal compo- 
nents are of interest , a method known under the name NIPALS 
(see , e.g. , H. Wold, Nonlinear estimation by Iterative Least 
Squares Procedures, Research Papers in Statistics, Wiley, New 
York 1966 ) may be applied as this method is faster since only 
the first components mentioned are determined . The NIPALS 
interpretation of the loading values (Pak) as partial 
regression coefficients makes the calculation of PC models 
uncomplicated, as is shown below. 

E a -1 = * e ik,a~i) = X - X b a ~ 1 t b *p b ' (7) 

e ik,a-l = t ia * Pak + e ik (8) 
Pak = ^i N ( e ik * t ia } / (t ia * t ia ) (9) 



The elements in pa are normally normalized to the unit length 

l| 

( p a j j = 1), which gives 

tia = £k YikPak (10) 

The standard deviation ( SD) of the row i of the residuals, s± 
is a measurement of the distance between the i'th observation 
vector and the PCA model . For this reason, this standard 
deviation, DMod , is often referred to as the distance to the 
model • 

To develop an exponentially weighted moving principal com- 
ponent analysis iEWMA) , which is utilized according to the 
invention, two steps are required . The first step, which com- 
prises updating and prediction (forecasting) of the process 
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values at the next point in time t + 1, is uncomplicated if an 
existing PCA model for the process is assumed. The second 
step, updating this PCA model for a process which is driving , 
proves to be complex . 

5 

The forecasting part is achieved by means of K multivariate 
process responses Y = {yi, y2 # Yin* YM> * and a PCA 

model with A components which is determined from these Y data. 
A process time point has A scores , tia (a=l, 2 , . A) , 
10 associated with it, which form a row in the score matrix T . 

Now, assuming a certain auto -regressive auto- correlation 
structure and a stable cross-section correlation structure in 
the data set, and thus a stable PCA model , the EWMA values in 
15 the scores t a will provide us with a base for multivariate and 
dynamic process control . 

Here we assume that the process is driven by only A indepen- 
dent "latent variables" , which indirectly are "observed" by 

20 the Y variables and determined as scores t a (a = 1, 2 , . 

A) „ This gives two alternatives for achieving control charts . 
Either one control chart may be maintained for each PC compo- 
nent, a, which is justified if the components have a separate 
physical meaning. An additional control chart may be construe- 

25 ted from the residuals of the standard deviation, the DMod 
cable. A second alternative is obtained by combining all the 
significant t and DMod into one single table, which, however , 
leads to a loss of information about the separate model 
dimensions „ 



30 



3 5 



The prediction about the score vector t {with A elements ) at 
the time t+1 is analogous to the equations (2 ) and (3) accor 
ding to : 

*t + l = Xt t + (l-X)fet 



t + 



A(t r - 6t)t {12} 
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The more elaborate form analogous to equation (5) is obvious . 
These thus predicted scores forecast the vector y of M 
variables according to : 

5 yt+1 = £t+l p' (13) 

The variance of £t+l,a is directly given by equation (4) with 
G a 2 determined by means of scores from a long series of 
"historical" data in the process . Because of a non-full rank 
10 of the matrix Y, classical variances of yt+1 cannot be 

determined without additional assumptions . If "partial least- 
squares assumptions " are made about some independent regula- 
rity of each y&, an acceptable variance of the forecasted 
vector yk would be : 



15 



30 



Var(yk / t*fl = £a Pka 2 ®ol 2 < 14 ) 



In the model according to the invention, an updated dynami- 
cally exponentially weighted PCA is further required, a way of 
20 handling the risk of rotation in the model , which will be dis- 
cussed below, as well as closer definition of centering and 
scaling . These questions will be dealt with one at a time . 

To achieve a weighed PCA, we are using exponent ial ly decree- 
25 sing observation weights , vi, according to equation ( 1 ) , 

whereby, with the aid of the weighted least-squares formulas 
and equation (9 } , the following is directly obtained: 



Pak 



Xi N (vi * e ik * t ia ) / Xi N (vi * ti a * t ia ) (15) 



Consequently, the NIPALS algorithm can be utilized directly 
with only minor modifications when calculating EWM-PC 
loadings by using the exponent i a 1 ly decreasing weights , vi, 
according to equation (1) . For a single, fixed Y, the other 
35 NIPALS steps remain unchanged . The unweighted scores, t a > 
however , are no longer orthogonal , whereas the weighted 
t iaVvi are orthogonal . 
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Prior to a multivariate modelling, data are usually centered 
by subtraction of the column mean values from the data matrix. 
The mean vector may be interpreted as a first loading vector, 
PO , with a corresponding score vector, to* which has each 
5 element equal to 1/N. 

in the present application, there are two natural ways to 
proceed for determining a centering vector. In one case, a 
constant mean value vector is used, determined from a long 
10 process history . In the second case, EWMA is used for each 

variable, (y^) , with a much smaller X than what is used in the 

EWMA-PCA weights (vi) . To stabilize the estimation of this 
EWMAk, this is calculated by using the residuals of the PC 
model instead of (normalized) raw data . Thus , the observation 
15 vector yt + 1 is centered and scaled by using the parameters at 
time t. Then, the predicted values are subtracted {by means of 
equation (13 ) ) , to give the residuals et+l> which are used to 
update EWMA^ in accordance with the equations (1) and (2). 

20 After the centering, data are scaled by multiplying each 

column in the data set by a scalar weight <t> k - By means of 
variance scaling (autoscaling) , <3> k is calculated as 1/sk, 
where s& is the standard deviation for columns . This again 
leads to two obvious choices ; to calculate sk from a long 

25 process history or to use an updated computation of sk based 
on weighted local data. A third option is based on a slowly 
updated " spanning " database , which is described below. 
Important variables may be scaled up by thereafter multiplying 
#k by a value between 1 and 3 and inversely ♦ Other less 

30 important variables may in a corresponding way be scaled down. 

The above-mentioned rotation problem, which may arise when 
that point swarm of observations in space, which in the model 
is projected onto first and second principal directions, more 
3 5 or less has a circular propagation. In such situations, each 
bilinear model, both PCA and PLS, is partially undefined with 
respect to rotation. See Figure 9 . In dynamically updated 
models , this leads to a potential instability; when a new 
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process observation is introduced in the model, this may lead 
to a rotation of the immediately preceding model , even if the 
new observation point lies very close to the model plane . This 
manifests itself as a jump in the score plot shown, which is 
5 incorrectly interpreted as a change of the process itself . To 
avoid this unintentional rotation, loading vectors from the 
preceding model are saved in an auxiliary matrix, a " P -memory M 
matrix, { " W -memory " in PLS) , here designated Q. Thereafter, 
when estimating the updated model, this P -memory matrix, 

10 exponentially weighted, is included according to the multi- 
biock PCA/PLS algorithm published in "MULDAST NEWS " , Report 
from the MULDAST symposium in Ume&, 4-8 June, 19 84, S. Wold et 
al. This can be seen as a Bayesean estimation of the PC model, 
where information from previous events is stored in Q, the P~ 

15 memory matrix . See Figure 10 . 

The consequence of including a memory matrix is that the 
updated loading vectors p a * (or w a in PLS) are forced not to 
differ too much from the preceding loading vectors . The 

20 balance between new and old values is checked by an adjustable 
parameter, a. The full algorithm is given below. 

A further difficulty to take into consideration is that , when 
losing the memory during an instability period, each recursive 

25 model estimation has a tendency to lose the information about 
previous periods . This is due to the fact that if the process 
is stable sufficiently long, only data without appreciable 
variation are retained and earlier data are weighted down and 
will have insignificant influence in the exponentially 

30 decreasing weights . 

To force the model to remember important events further back 
in its history, a second auxiliary matrix is also used, a 
reference data matrix, Y*, in the model determination. This 
3 5 matrix contains those observations (points) in the process 
which span all the space of previous observations and which 
are updated whenever a new process observation has a score 
(t a 5 which exceeds a certain fixed limit value . These limit 
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values may be derived from "historical" data, that is , 
previously occurring extreme values , or be preset by the 
process operator. By analogy with this, an additional 
reference matrix for the loading vectors , Q* , is involved such 
5 that the process memory regarding the loading vectors does not 
disappear during some period of instability . 

As in the P -memory matrix (Q) , the rows in the reference 
matrices (Y* and Q* ) are exponentially weighted, but with a 
10 slower decrease by the use of a smaller value y which is used 
instead of the greater X in equation (1) . 

Many processes now and then generate "spikes " , that is , devia- 
ting values which should not be included in the modelling 

15 work. The simplest way of handling these spikes is to cal- 
culate the distance in the Y-space between a new observation 
and the preceding one . Observations with widely differing 
values, which create scores (t a ) far beyond the "norm ranges " 
compared with the values of reference data according to the 

20 above , are discarded after a message to the operator, unless 
several consecutive process observations demonstrate a 
consistently deviating pattern. 

Applying the method according to the invention (EWM-PCA or 
25 PLS ) to a set of historical data with a given set of parameter 
values Ai to X3 gives predicted errors of one- step- ahead 

forecasts for each score, t a , and for each y-variable. The sum 
of the squares of the differences between actual values and 
predicted values thus forms an estimation of predictive power 
3 0 of the model in the same way as with cross-validation. This 

sum, PRESS, has components from each score, or y-variable, or 
both, weighted according to their perceived importance. To 
find the best combination of the values of X± to X3 , - Response 

Surface Modelling" (RSM) is recommended. In this approach, 15 
3 5 models with different parameters are evaluated in parallel . 
The 15 parameter combinations ( j=l, 2 , . . . , 15 ) are selected 
according to a "Central Composite Inscribed" (CCD design with 
low and high values being around, for example, 0 . 15 and 0.45. 
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This is then followed by a regression of y=log (PRESS j ) 
against the extended design matrix X=X, which gives a predic- 
ted combination of parameter values which provides a minimum 
of PRESS . 

5 

A step-by-step overview of the process model according to the 
invention will be presented in the following: 

1. Select parameters X± to A3 in equation (5) {or egs . 11, 

10 12) in the simplest case. This is done based on 

experience or estimation of values which give the best 
predictions in a longer process history, 

2. Select a starting matrix, Yq , in accordance with process 
15 data at the beginning of the time interval of interest . 

If PLS modelling is used, the two starting matries Xq and 
Yq are needed. From these, column mean values and stan- 
dard deviation are calculated for centering and scaling 
of data. 

20 

3. Use weighted PCA (or PLS) to derive an initial model of 
the process from normalized data according to step 2. 

4 . Initiate the data memory matrix by including the data 

2 5 from Yq in PCA and Xq in PLS which correspond to the 

maximum and minimum score values of each model dimension, 
a • 

5. Initiate the loading or weighting memory matrices , Q a , 

3 0 Cement or w mem ) , one for each component, a, with p a ' or 

w a * as the first and single rows . 

6. Initiate Q*, the long-term spanning P a or W a matrices, 
identical with those in step 5 . 

1. Make a one-step-ahead forecast of scores t a ,t+l- Then 

calculate predicted y values from t a# t + l and P " (PCA) or 

C (PLS) . 
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8 . Fetch the observed values yt + 1 (and xt+l for PLS! . 

Investigate whether they contain spikes . Center and scale 
them by using normalization parameters from the previous 
step (time=t) and calculate the current scores t a ,t+l 
and the remaining residuals e = Yt + 1 ~ fc a, fc + l^t ' • 

9 . Update the centering parameters by means of the residuals 
e . 

10. Update the EWM-PC or PLS model by iterating the algorithm 
to convergence . 

11. Update the memory matrices Q a * (^mem, a or W me m ? a for 
PLS } , and, if justified, also the data memory matrix Y 
and the matrices P, W, Q* a and the memory matrix Y* . 

The difference between EWM-PCA and EWM-PLS may be described 
such that, in the 51 PLS 88 situation, the process data have been 
divided into two (or more) blocks; X referring to input data 
and Y referring to "output data' 5 , that is, a performance and 
quality assessment of the product. It may here be desired to 
monitor and forecast the process (X) , the result ( Y) , or both. 
The algorithm for updating the model becomes somewhat more 
complicated by the inclusion of the Y block. The data memory 
will also have a Y block. Forecasts of Y are made directly 
from the forecasted X scores (t) , as shown in step 7 above, 
and X data in the same way as with EWM-PCA. The inclusion of 
the Y block stabilizes the model and reduces the constraints 
on P. 

One of the most determining advantages of the process model 
according to the invention is that it becomes possible to 
follow the course of the process dynamically with the aid of a 
display, wherein first scores (ti and t2 ) are plotted against 
each other or separately versus time. Such a representation 
gives a good picture of how the process is developed. The 
distance to the model, Dmod, that is, the standard deviation 
of the Y residuals (X residuals for PLS) may be included as a 
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separate reproduction or be included in the score reproduction 
as colour in dependence on the distance mentioned. See Figure 

•J— -J- « 

5 Tests with process monitoring according to the model have been 
carried out experimentally , inter alia on ore treatment , which 
has allowed the process to be monitored and clearly shown when 
the process does not lie within the normal framework. 

0 Further, in summary , it can be said that the present model 

with EWM-PCA and EWM-PLS provides us with multivariate windows 
on a dynamic process , wherein the dominating properties of the 
development in the process are shown as scores plus a measured 
value of how far process data (new observations) lie from the 

5 model - If there is an auto-correlation structure in the 

scores , one- step- ahead forecasts of process scores ( t a ) and 
process variables (y or x) may be used for diagnosing and 
controlling the process . 

0 The algorithms for modelling the process are shown in the 
following step by step. 

It is assumed that suitable values of the parameters X± to X3 

are available, both as values of centering and scaling con- 
5 stants , EWMAk and €>k for the variables (xk, yk) • 

The EWM-PC algorithm 

1. Select suitable parameter values (Xj ,y) . 

0 

2 . Start with an initial matrix Yo, magnitude No x K. Set 
Y=Yq. The memory matrices Y* and Q are initialized as 

empty . 

5 3. The weights vi and v£* are calculated according to 

equation (1) with the parameters X and 7. The weighted 

mean value of each variable (k) is calculated from Y: 
EWMAk = XjVj/yik / XiVj.* 
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The scaling weights (4>k) are calculated from both Y and 
Y* (note that Y* is centered) : 

s k 2 = [Plivi (y ik - EWMA k ) 2 + d-PlEjVj *yjk* 2 3 / fPSiVi + 

(l-P)SjVj*] 

*k = 1 / s k 

The constants are left as zeros with zero weight. 
Important variables may be scaled up or down by multipli- 
cation of the scaling weights above by a suitable adjus- 
ter between, for example 0.3 and 3= The parameter (3 which 

determines the relative effect on the current data and 
reference data may lie somewhere between 0.1 and 0.9 
depending on the stability of the process. 



4. Center and scale Y with the centering parameters EWMAk 
and the scaling parameter <I>k. 

0 

Yik (normalized) = (yik (row) ~ EWMAk) **k 



5. The central part of the EWM-PC algorithm is initiated 
here; the determination of the weighted PC model . The 

5 additional steps caused by cross-validation are not 

explicitly shown; they substantially comprise elaboration 
of the algorithm below several times with different parts 
of data deleted and afterwards predicting the deleted 
data from the model. The model dimension, A, with the 

0 smallest prediction error (PRESS) is selected, with 

preference for a smaller A, if PRESS is largely the same 
for different model dimensions. 



(i) . Set dimension index a to one. 

(ii) As starting vectors for p a and q a (loading vector) , 
the ones from the previous time points are used. At 
the very first time , the last row in Yq is used, 
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normalized to length 1 . 

(iii) Calculate the scores ti a . To compensate for 

missing data, dummy variables (dik) are used, which 
are zero if element yik is missing, otherwise equal 
to one . 

tia = ^ikYikPka / 2k dikPka 2 



10 



If there is a reference matrix, Y* , the correspon- 
ding scoxres , 

tia , for this matrix are calculated 
by using djk and yjk* instead of dik and yik in 
the above equation . 



15 



iv) Calculate the loading vectors , pka r using the same 
dik for compensation of missing data. 



Pka = X-i d ikYiktia / Si d ik^ia 2 



20 



Normalize p a to length one; p a = p a / 



Pa 



25 



If there is a reference matrix, Y* , the correspon- 
ding loading scores, Pka*' for this matrix are cal 
culated by using d jk and yjk* and ti a * instead of 
dik. Yik and t i a in the above equation. 



Form p a as the weighted combination of two calcu 
lated p a values. 



30 



Pa = P Pa + (l-(J)Pa 



Normalize the new p a to length one. 



35 



(v) Check the converaence of 



p a,new p a,oid 



/ 



which must be smaller 



a, new 

than 10~6 to indicate convergence . If convergence 
exists, proceed with step (ix) , otherwise step { vi ) 
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(vi ) If the calculation is made at a first time, 
return to step (iii) . Otherwise, continue to 
step (vii) . 

5 (vii) Calculate the scores n a and xi a * for the loading 

matrix P and the reference matrix P* , respectively . 

u ia - 2Lk Pmem, a, ik Pka / 2Lk Pka 2 
10 Uja* = 2k Pref , a, jk Pka / Xk Pka 2 

( viii ) Calculate the ■ loadings " of the two loading and 
reference matrices according to: 

15 °3ka = 2-i Pmera, a, ik u ia / 2-i u ia 2 

Qka* = Pref, a, jk u ja* f £j (u ja* ) 2 

Form Q a as the weighted combination of two calcula- 
20 ted q a values. 

Qa = P Qa + (l~P)q a * 

Normalize this new qa value to length one. Use the 

2 5 weighted combination (weight a) of this vector and 

p a (weight 1-oc) and return to step (iii). 

(ix) If convergence exists, the final t a and t a * for the 
two data blocks are calculated, and from these 

3 0 temporary loading vectors which are used only to 

form the residuals to provide data in the next 
model dimension calculations. This is necessary to 
preserve orthogonality of the scores and is 
analogous to the orthogonalization step w t — » p 

3 5 in ordinary PLS regression. 

After this, the residuals Y - t a Pa ' and Y* - 

- t a *Pa~ are formed. Add one to the model dimen- 



BNSDOCiD. <WC_ 9518420A2_I_> 



WO 95/18420 



PCT/SE94/01208 



10 



20 



30 



sion (a = a + 1) and proceed with the next dimen- 
sion by using the residuals Y and Y* as the data 
matrices in this next dimension. 

(x) The algorithm is terminated when the number of 
dimensions , a, of the model equals the desired 
number of "significant " dimensions {variables ) , A, 
in the model , whi ch is determined by cross- 
validation, or based on experience . 



If instead an EWM-PLS algorithm is used, the difference 
between these is that the latter (PLS) includes both X 
blocks and Y blocks for data and reference data , respec- 
tively. By replacing Y by X and Y* by X* , loadings by PLS 
15 weights and p by w in the algorithm above , some sub-steps 

are added in step (iii) in the above algorithm. After 
calculating the scores t a and t a */ these are used for 
calculating Y weights , c a and c a * , respectively, which in 
turn leads to Y scores , here designated r a instead of u a . 



{iiia ) Y and Y* weights 



c ma = £i di m yim ti a / Xi di m ti a 2 



25 and analogously for c ma * . 



(nib) Scores t a and t a 



r ia = £m di m yim c m a / Xm dim c ma 2 



and analogously for r j a * • 



These scores , r and rj a * , are then used instead of t and 
t* , respectively, to calculate the PLS weights in step 



Finally, after convergence, the residuals (F a ) of each Y block 
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are formed by subtracting the relevant t vector multiplied by 
the relevant c vector. These residuals are then used as Y and 
Y* in the next dimension. 

5 After convergence of the above algorithm, the resulting scores 
(only t and u values ) are compared with maximum and minimum 
values with corresponding scores for the reference data and 
the loading matrices . Thus, when the reference matrices are 
initially empty, the data vectors corresponding to the 

10 , greatest and smallest t and u values for each model dimension 
are saved in the reference data matrix Y* and P*, respec- 
tively . The extreme scores are saved for later comparisons . in 
following updates, a score value which is below the minimum or 
above the maximum previous score with the same dimension means 

15 that the corresponding data vector is included in the 

reference matrix and a new score value is saved. Two variants 
of Y* may be noted, one where old data are deleted from Y* and 
where no exponential weighting of Y* is made, and another, 
recommended variant , where Y* is extended with the new data 

2 0 vector by using a slowly decreasing exponential weighting of 

Y* . The same principles are used for the reference matrices 
for loadings or PLS weights. 

The described algorithm forms the basis of how a multivariate 
25 process can be illustrated graphically, as mentioned above in 
the description of the invention. On the basis of observed 
facts, as, for example, because of the drift of some indi- 
vidual variable, the process may be restored to a normalized 
position by the fact that the variable in the process may be 

3 0 directly influenced. 

Physically, the process monitoring according to the invention 
is achieved by measuring the measured data of relevant quanti- 
ties by means of measuring devices for the respective physical 
35 quantity in the monitored process in a known manner . The 

measured values are passed via a process link to a computer, 
which is programmed to create models of the process according 
to the invention. The model or models are presented graphi - 
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cally on a screen, where according to the invention the 
process in its entirety is projected onto a plane or a hyper- 
plane and where the projection contains all relevant informa- 
tion about the process , which makes it possible for the opera - 
5 tor to take accurate action, based on facts , in the form of 
intervention in the physical quantities of the process , for 
example by adjusting the pressure or temperature levels, 
contact forces for rolls in a machine, etc., all according to 
which is indicated according to the visualization of the 
10 process . This type of information and the possibility of 

physical intervention in the process have not existed accor- 
ding to the prior art, since a real-time on-line study of the 
effect of many quantities on one another in a process has not 
been possible . 

15 

The calculations for the different steps to obtain the model 
of the process , referred to according to the invention, are 
implemented by calculating units , which are schematically 
reproduced in Figure 12 where a clear overview of the 
20 calculation steps is given by means of a flow chart . If, in 
step 2 in Figure 12 , t a , i+1 and/ or DModX end up outside the 
allowable control interval , different loading plots are used 
to identify which process variables (x^) have caused the 

process to leave its operative norm range, whereby the 
25 variable or variables which have caused the drift in the 

process are adjusted to values which are predicted to restore 
the process to a norm range as soon as possible . 
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CLAIMS 

1 . A method for monitoring and control of an industrial or 
other technical process, the course of which is dependent on a 
5 multiple of variables relevant to the process , comprising the 
steps of 

registering the measured values of the variables as an 
observation at a certain time, 
10 - describing the variables as a multidimensional room, 

wherein each variable represents one dimension in the 
room , 

representing each observation as a point in the multi- 
dimensional room, wherebv a series of observations 
15 carried out at different times will be represented by a 

point swarm in space, 

calculating at least one first !p a ) and one second (pb) 
principal direction in space for the point swarm , 
determining the projections of the observations onto the 
20 principal directions {p a , Pb) > 

inserting the projection of the observations onto a point 
as a point on an electronic screen in a linear room which 
is spanned by the principal directions (p a? pb) . 

2 5 characterized in that 

the principal directions (pa^Pb) of the point swarm in 
space are updated during the course of the process , which 
allows the process to be dynamically monitored, 
30 - the norm range of the process , as well as process- 
influencing variables, are indicated on the screen, 
the process -influencing variable or variables which 
indicate drift in the process are adjusted by the process 
operator or the monitoring member , whereby the process is 
restored to the norm range . 



5 



2 . A method according to claim 1 , characterized in that the 
process is illustrated dynamically, on-line, as a picture on a 
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screen, wherein a first (p a ) and a second (pb) principal 
direction define the plane shown on the screen, wherein the 
projections of the observations are illustrated as points on 
the screen, and wherein the process is visualized in the form 
5 of a snake which crawls over the screen from one point tc 
another point representing consecutive observations of the 
process in time. 

3. A method according to claim 1 , characterized in that the 
10 process is illustrated dynamically, on-line , as a picture on a 
screen, wherein a first, a second and a third principal direc- 
tion (Pa f Pb) define a three-dimensional room which is shown on 
the screen , wherein the projections of the obs erva t i ons are 
illustrated as points on the screen, and wherein the process 
15 is visualized in the form of a snake which crawls over the 

screen from one point to another point representing consecu- 
tive observations of the process in time . 

4 . A method according to claim 2 or 3 , characterized in that 
20 the process is illustrated dynamically, on-line , by deter- 
mining the deviations (the residuals ) between the calculated 
model of the process and the reference model as a quantity 

( Dmod ) and by indicating this distance of the process to the 
model (Dmod) by the snake changing colour in dependence on the 
2 5 distance to the model (Dmod) . 

5 . A method according to claim 4 , characterized in that all 
of or parts of the crawling snake representing the course of 
the process are coloured in various colours in dependence on 

30 the magnitude of Dmod, that is , a certain colour for the 
visualized parts of the process where the distance to the 
model is outside a certain norm range for Dmod and other 
colours for the visualized parts of the process where this is 
wi thin the norm range of the process. 

6 . A method according to claim 2 or 3 , characterized in that 
the snake initiates an alarm when the snake on the screen 
enters a range which lies outside a norm range for the 
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process defined on the screen in advance. 

7. A method according to claim 2 or 3 , characterized in that 
that part of the process which is of current interest is 

5 indicated as a more intensely or brightly represented snake. 

8. A method for monitoring and control of an industrial or 
other technical process, the course of which is dependent on a 
multiple of variables relevant to the process, comprising the 

10 steps of 

registering the measured values of the variables as an 
observation at a certain time, 

describing the variables as a multidimensional room, 
wherein each variable represents one dimension in the 
room, 

representing each observation as a point in the multi- 
dimensional room, whereby a series of observations 
carried out at different times will be represented by a 
point swarm in space, 

calculating at least one first (p a ) and one second (pb) 
principal direction in space for the point swarm , 
determining the projections of the observations onto the 
principal directions (pa* Pb) r whereby a model of the 
process is obtained , 

calculating the deviation ( Dmod ) between the calculated 
model and a reference model for the process . 

characterised in that 
30 

the principal directions (pa#Pb) of the point swarm in 
space are updated during the course of the process , 
whereby the calculated model of the process is dynami- 
cally adapted to the process in real time , 
35 - initiating an alarm when the distance of the process to 

the model exceeds a predetermined alarm limit . 

9 . A method according to claim 8 , characterized in that that 



15 



20 



25 
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the distance of the process to the model (Dmod) is shown on- 
line in a figure where said distance to the model (Dmod) is 
plotted as a function of the time of process observations 
made . 

5 

10 . A method according to claim 8, characterized in that the 
distance of the process to the model ( Dmod ) consists of the 
standard deviation for the deviation of the respective obser- 
vation from a reference model of the process. 

10 

11. A method according to claim 2 , characterized in that the 
relevant variables in the process are illustrated as a picture 
on a screen, a score reproduction , wherein a first {pa ) and a 
second (pt>) principal direction define the plane shown on the 

15 screen, wherein the positions of the individual variables are 
projected onto this plane, and the midpoint (x) of the point 
swarm in this plane is illustrated as points on the screen . 

12. A method according to claim 11, characterized in that 
20 the process , indicated in the form of a snake as well as a 

score plot , is at the same time visualized on the basis of the 
same plot plane defined by the same first (p a ) and second (pb) 
principal directions on separate or on the same screen image, 
whereby the movement of the snake over the screen can be 

25 immediately related to process -driving variables , in that the 
directions from the midpoint (x) of the point swarm in the two 
images correspond to each other , whereby the direction of 
crawling of the snake away from the midpoint (x) of the point 
swarm indicates an influence on the process by one or more 

30 variables which in the score plot lie in the same direction of 
travel as the movement of the snake, whi ch means that the 
process operator or the monitoring member receives information 
about which variable /variables result in drift in the process , 
allowing the operator or the monitoring member to easily act 

3 5 on the disturbing variable . 
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A PCA model is calculated on data for the 
period immediately preceding the present 
time, including 

a) number of components A 

b) alarm limit DModX C rit 

c) parameters P a ; a=1...A 

Alternatively, if on-line Y (responses) are 
available t calculate a PLS model in similar 
manner with 

a) number of components A 

b) alarm limit DModXcrit 

c) parameters P a ; a=1...A 
parameters w a ; a=1...A 
parameters c a ; a=1»*«A 



initialization of different memory matrices 
(Q, CT, Y*) 

On-line connection to the process and 
updating of the model 

1, Entry of new data vector from 
the process, X 

Centering of variables in X 
Scaling of variables in X 

2. Calculate scores t a , t+1 

residuals e t+1 
DModX 



3. Updating: 



- Centering of parameters 

- PCA model (PLS) 

- Memory matrices Q & Y 
(corresponding for PLS) 
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